CERN Tape Archive: a distributed, reliable and scalable scheduling system

نویسندگان

چکیده

The CERN Tape Archive (CTA) provides a tape backend to disk systems and, in conjunction with EOS, is managing the data of LHC experiments at CERN. Magnetic storage offers lowest cost per unit volume today, followed by hard disks and flash. In addition, current drives deliver solid bandwidth (typically 360MB/s device), but high latencies, both for mounting drive positioning when accessing non-adjacent files. As consequence, transfer scheduler should queue requests before warranting mount reached. spite these user-interactive operations have low latency. scheduling system CTA was built from experience gained CASTOR. Its implementation ensures reliability predictable performance, while simplifying development deployment. expected be used long time, lock-in vendors or technologies minimized. Finally, quality assurance were put place validate performance allowing fast safe turnaround.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Proposal for a Web-Based Lecture Archive System for CERN

This document outlines a proposal for the development of an electronic archive system for lectures, tutorials and other slide-based presentations at CERN. The program is based on the implementation of existing software designed to synchronize electronic slide presentations with audio/video recordings through an integrated web interface featuring full indexing and controlled playback. The system...

متن کامل

CScale - A Programming Model for Scalable and Reliable Distributed Applications

Today’s connected world demands applications that are responsive, always available, and can service a large number of users. However, the task of writing such applications is daunting, even for experienced developers. We propose CScale, a programming model that attempts to simplify this task. The objective of CScale is to let programmers specify their application’s core logic declaratively with...

متن کامل

Scaling a Reliable Distributed System

We consider the problem of reliably connecting an arbitrarily large set of computers (nodes) with communication channels. Reliability means here the ability, for any two nodes, to remain connected (i.e., their ability to communicate) with probability at least μ, despite the very fact that every other node or channel has an independent probability λ of failing. A simple solution to the problem c...

متن کامل

TaskMaster: A Scalable, Reliable Queuing Infrastructure for Building Distributed Systems

TaskMaster is a system for managing priority-ordered queues that is designed to scale to 1 billion tasks across 100 thousand queues per node. A reliable queuing system, such as TaskMaster, provides a mechanism for distributing units of inherently serial work (tasks) to workers. Priorities are lexicographically ordered strings that give users more power than FIFO or fixed-range integer prioritie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Epj Web of Conferences

سال: 2021

ISSN: ['2101-6275', '2100-014X']

DOI: https://doi.org/10.1051/epjconf/202125102037